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Abstract 


DLTS  (deep  level  transient  spectroscopy),  PICTS  (photo-induced 
current  transient  spectroscopy),  and  EBICTS  (electron-beam  induced 
current  transient  spectroscopy)  are  experimental  techniques  to  deter¬ 
mine  the  deep  level  spectrum  of  non-ideal  semiconductors.  These  meth¬ 
ods  operate  by  externally  exciting  a  sample  of  the  material,  and  then 
monitoring  the  time  constants  of  the  subsequent  return  to  equilibrium. 
We  analyze  the  conventional  (“rate  window”)  method  of  evaluating  the 
experimental  data  obtained  by  these  techniques,  and  find  it  wanting 
in  many  respects.  A  new  method  (termed  “spectral  analysis”)  is  then 
presented  which  is  superior  both  in  terms  of  accuracy  and  resolution, 
and  which  also  makes  the  deep  level  spectroscopy  techniques  more 
suited  for  numerical  evaluation. 


Introduction 

The  performance  of  radiation  (i.e. ,  laser  or  electron- beam)  controlled 
solid  state  switches  depends  strongly  on  the  properties  of  the  employed 
semiconductor  material.  Recent  investigations  have  shown  that  not 
only  the  ideal  features  of  the  material  are  of  importance,  but  also  the 
non-ideal  characteristics  related  to  the  nature  and  the  concentration 
of  impurities  and  other  crystal  flaws.  By  inducing  additional  discrete 
states  (so-called  deep  levels  or  traps)  into  the  band  structure  of  the 
semiconductor,  one  can  drastically  influence  the  critical  performance 
measures  like  efficiency,  hold-off  voltage,  and  recovery  time  [1],  and  can 
even  open  the  possibility  of  completely  new  switch  concepts  [2], 

In  this  situation,  both  the  analysis  and  the  design  of  solid  state 
switches  require  reliable  information  about  the  location,  the  cross  sec¬ 
tions,  and  the  number  densities  of  the  deep  levels  in  the  employed  mate¬ 
rial.  Several  experimental  methods  have  been  designed  to  obtain  this  in¬ 
formation,  the  most  widely  used  are  known  by  their  acronyms  as  DLTS 
(deep  level  transient  spectroscopy)  [3],  PICTS  (photo-induced  current 
transient  spectroscopy)  [4],  and  EBICTS  (electron- beam  induced  cur¬ 
rent  transient  spectroscopy)  [5].  The  principles  of  these  methods  are 
very  similiar,  they  operate  by  externally  exciting  a  sample  of  the  ma¬ 
terial,  and  analyzing  the  time  constants  of  the  subsequent  return  to 
equilibrium  as  a  function  of  the  temperature. 

The  last  two  methods  are  particularly  similar,  for  they  both  use 
irradiation  as  means  of  excitation,  and  monitor  the  sample  conductance 
as  a  measure  for  the  excitation.  Both  methods  are  especially  suited  for 
the  analysis  of  semi-insulating  semiconductor  materials  which  form  the 
basis  of  all  optically  activated  switch  concepts.  The  principle  set-up  of 
the  methods  is  schematically  depicted  in  Fig.  1. 
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Fig.  1:  Principle  set-up  for  deep-level  spectroscopy,  with  “irradiation 
source”  denoting  either  a  laser  (PICTS)  or  an  electron-beam  (EBICTS). 


A  sample  of  the  material  under  consideration  is  furnished  with  two 
coplanar  Ohmic  contacts  of  a  few  mm  spacing,  and  then  mounted  on 
a  thermally  isolated  holder  within  an  electrically  grounded  cryostat. 
The  temperature  of  the  cryostat  can  be  adjusted  in  the  range  from  77  K 
(liquid  nitrogen  cooling)  to  400  K  (electrical  heating),  it  can  be  directly 
monitored  by  means  of  a  T-type  thermo  element.  Through  an  opening 
in  the  cryostat,  the  device  is  subjected  to  an  irradiation  pulse  (laser 
light  in  the  case  of  PICTS,  energetic  electrons  in  the  case  of  EBICTS) 
which  brings  it  into  a  state  of  high  electronic  excitation;  the  increased 
conductivity  and  its  subsequent  return  to  equilibrium  can  be  monitored 
by  means  of  an  external  voltage  source  and  a  current  viewing  resistor. 
The  signal  is  then  recorded  by  a  transient  digitizer  and  finally  stored  as 
a  numerical  computer  file. 


The  basic  assumption  of  deep  level  spectroscopy  is  that  the  time 
constants  which  govern  the  return  to  equilibrium  contain  essential  infor¬ 
mation  on  the  level  spectrum  of  the  material.  In  general,  the  dynamics 
can  be  quite  complicated  to  analyze,  let  us  thus  focus  on  a  situation 
where  only  one  isolated  electron  trap  of  total  concentration  N  is  active 
in  a  material  with  electron  lifetime  t.  In  this  case,  the  carrier  kinetics 
can  be  described  in  terms  of  two  rate  equations  for  n  and  nj,  the  density 
of  the  electrons  in  the  valence  band  and  in  the  trap,  respectively, 


^  =  K (ennT  -  n(N  -  nT ))  - 
=  K  (n(N  -  nr)  -  ennj)  ■ 


(1) 

(2) 


I(  and  ejy  describe  the  constants  of  electron  capture  and  thermal  emis¬ 
sion  of  the  trap,  they  depend  on  the  cross  section  it,  the  ionization 
energy  A E  and  the  temperature  T.  (Furthermore,  g  is  the  statisti¬ 
cal  weight  of  the  level,  m  the  effective  electron  mass,  k  and  h  denote 
Boltzmann’s  and  Planck’s  constant,  respectively.) 
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To  describe  the  recovery,  the  equations  can  be  even  more  simplified: 
The  term  ~  n(N  —  Nt)  representing  back-trapping  becomes  negligible, 
and  the  term  ^  on  the  left  of  (1)  can  be  dropped  (under  the  assumption 
that  KN  <C  r-1,  i.e.,  that  the  trap  is  dominated  by  other  channels  of 
electron  recombination.)  We  can  then  solve  the  equations  explicitly; 
assuming  that  the  trap  is  filled  at  t  =  0,  the  electron  density  is 

n(f)  =  XtN  exp(— At).  (5) 


(The  sample  current  i  is  proportional  to  this  quantity,  times  a  factor 
which  depends  on  the  geomety  of  the  sample,  the  applied  voltage,  and  on 
the  specific  mobility  of  the  carriers.)  The  time  constant  A  equals  Ken, 
in  terms  of  the  cross  section  and  the  energy  level  of  the  trap  it  is 


A  = 


2  amk2 
n2g?i3 


T2  exp 


(6) 


Under  the  assumption  that  the  deep  level  parameters  contain  no  implicit 
dependence  on  the  temperature  T,  we  can  plot  A/T2  logarithmically  and 
obtain  a  linear  function  in  l/T,  the  so-called  Arrhenius  plot.  The  slope 
of  this  function  is  proportional  to  the  ionization  energy  of  the  deep  level 
A E,  and  the  absolute  term  contains  the  cross  section  a. 


So  far,  the  evaluation  of  the  data  obtained  by  deep  level  spectroscopy 
seems  completely  straightforward.  We  have,  however,  tacitely  glossed 
over  a  problem  that  is  by  no  means  trivial:  How  can  one  determine  the 
rate  constants  A  from  the  actually  observed  current  i(t),  in  particular 
when  there  is  a  chance  of  more  than  one  trap  being  active  in  the  data? 
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The  Rate  Window  Technique 

The  traditional  procedure  of  evaluating  deep  levels  spectroscopy 
data  is  known  as  the  rate-window  technique  [6].  The  method  is  based 
on  the  observation  that  the  difference  of  the  response  current  at  two 
time  points,  Ai  =  i(tj)  —  *(<2)1  is  unequal  from  zero  only  when  the 
time  constant  lies  within  a  certain  window  (see  Fig.  2).  Indeed,  for  a 
single  exponential  mode  of  the  form  i(t)  =  Aexp(— At)  the  difference  as 
a  function  of  A  has  a  maximum  at 


A  _  MWji) 

h  -  h 


(7) 


Correspondingly,  if  the  temperature  of  the  sample  is  scanned  through 
a  certain  interval,  a  peak  in  the  plot  of  Ai  over  T  marks  the  passing  of 
time  constant  though  the  “rate  window”. 


Fig.  2:  Signal  difference  At  as  a  function  of  the  time  constant  of  a 
single-mode  signal  (itself  being  a  function  of  the  sample  temperature). 

To  analyze  the  sensitivity  and  the  resolution  of  the  rate  window 
technique,  it  is  advantageous  to  switch  to  a  different  point  of  view. 
Instead  of  modifying  the  rate  constants  embodied  in  i(f)  (by  changing 
the  temperature  of  the  sample),  we  concentrate  on  a  single  response 
curve  and  evaluate  Ai  as  a  function  of  the  window  interval  [<1 ,  <2]- 
Assuming  that  the  ratio  fi/f2  =  q  is  constant  (=  0.5  in  the  following 
examples),  and  introducing  a  new  independent  variable  A  via  relation 
(7),  we  can  express  the  signal  difference  as 

,ff/ln(l/q)\  .(  ln(l/q)  \ 

A'(A)  =  JlO^)A)-8l(T^jA>  (8) 


Fig.  3:  Signal  difference  Ai  for  a  single-mode  response  i(t)  =  exp (— f), 
as  function  of  the  spectral  parameter  A  (an  independent  variable). 


We  infer  from  the  form  of  expression  (8)  that  a  change  of  the  tem¬ 
poral  scale  of  the  input  signal,  t  — >  at,  corresponds  to  a  transforma¬ 
tion  A  — >  A /a  in  the  function  A(A).  It  is  thus  natural  to  plot  the 
signal  difference  in  logarithmic  coordinates  (where  re-scaling  is  equiv¬ 
alent  to  a  lateral  shift);  we  will  refer  to  such  a  plot  of  Ai  over  ln(A) 
as  the  “rate  window  spectrum”  of  i(t).  For  a  single-mode  response 
i(t)  =  Fexp(-At),  the  spectrum  has  a  single  peak  located  at  A  (the  rate 
constant  of  the  signal),  it  falls  to  zero  both  for  A  — ►  0  and  A  — ►  00. 
Fig.  3  illustrates  this  for  the  normalized  case  I  =  A  =  1,  where  the 
spectrum  follows  (for  r\  =  0.5)  the  explicit  form 

Ai(A)  =  2~x(i-2“J).  (9) 


Because  of  the  linearity  of  definition  (8),  the  spectrum  of  a  sum  of 
exponential  modes  consists  of  the  superposition  of  the  corresponding 
single-mode  responses.  Unfortunately,  however,  the  width  of  the  peaks 
is  relatively  broad,  and  they  can  be  resolved  individually  only  when  the 
time  constants  are  sufficiently  separated  from  each  other.  Figs.  4  and  5 
demonstrate  this  effect  for  two  modes  of  the  same  amplitude,  separated 
by  factors  \il^\  of  5,  10,  100  and  1000,  respectively. 


Time  t 

Fig.  4:  A  signal  i(t)  consisting  of  two  superimposed  mode  of  equal 
amplitude  (A  =  1),  with  Aj  =  1  and  A2  =  5, 10, 100, 1000,  respectively. 
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Fig.  5:  The  rate  window  spectra  for  the  four  different  cases  of  Fig.  4. 

The  figures  demonstrate  that  different  modes  can  only  be  resolved 
when  the  separation  between  the  time  constants  exceeds  a  factor  of  ten. 
For  traps  with  identical  cross  sections,  this  corresponds  to  an  energy 
resolution  of  not  better  than  2 kT  (about  50  meV  at  room  temperature). 
Clearly,  it  is  desirable  to  do  better  than  that. 
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Spectral  Analysis 

Our  alternative  evaluation  method  “spectral  analyis”  is  motivated 
by  the  form  of  the  data  collected  in  deep  level  spectroscopy  experiments. 
Namely,  what  one  actually  obtains  from  a  set-up  like  the  one  depicted 
in  figure  1  is  not  just  two  data  points  4  and  4  (nor  a  continous  curve 
i(t)  as  assumed  for  our  analyis),  but  a  discrete  set  of  digitally  recorded 
values  4  at  equally  spaced  timepoints  4  =  (k  -  1) At,  k  =  1 ...  IV. 
Taking  into  account  that  the  measurement  will  also  be  subject  to  some 
superimposed  noise,  the  data  (of  a  signal  with  M  modes)  has  the  form 


Standard  variational  calculus  allows  us  to  uniquely  determine  the 
minimizing  spectrum  as 


7(A)  = 


N 

y  J^A  exp(-A4)  A  e  S 

k= l 


0 


A£  S, 


(17) 


where  the  Jk  denote  the  solution  of  the  matrix  equation 


M 

ik  =  y  In  exp(  —  Am4)  +  bik-  (10) 

n=i 


1  N 

-jrJk  +  y  MuJi  =  4 


(18) 


The  task  of  any  evaluation  method  is  to  invert  this  relation,  i.e., 
to  determine  the  2 M  mode  parameters  from  the  N  measured  data  4- 
Obviously,  because  of  the  super-imposed  noise,  there  cannot  be  an  exact 
solution  to  that  problem,  the  best  we  can  hope  for  is  an  “optimal  choice” 
based  on  some  appropriate  statistical  procedure.  The  first  approach 
that  comes  to  mind  is  a  least-square  fit,  with  the  7M  and  AM  chosen  such 
that  the  total  squared  deviation  assumes  its  minimum: 

N  M  -  , 

A2  =  y  (y  7>iexp(-Af,4)  -  4j  =  Min.  (11) 

fc= l  (<=i 

For  two  reasons,  however,  this  is  not  the  optimal  approach.  First,  the 
system  of  equations  derived  from  (11)  is  nonlinear  (and  ill-conditioned), 
so  that  a  direct  solution  is  difficult  to  obtain.  And  secondly,  the  number 
of  modes  present  in  the  signal  is  not  known  a  priori,  so  that  M  must 
be  treated  as  an  additional  unknown.  Both  difficulties  can  be  overcome 
if  we  allow  not  only  for  discrete  modes,  but  also  for  a  continuous  mode 
distribution  (a  “spectrum”)  /(A)  by  making  the  ansatz 

too 

4  =  /  I(X)exp(-Xtk)dX  + 6ik.  (12) 

Jo 

Clearly,  it  is  not  possible  to  calculate  the  spectrum  directly  by  means 
of  a  least-square  fit;  the  knowledge  of  a  finite  number  of  data  points 
(4)  is  not  sufficient  to  completely  determine  a  continuous  function  (/). 
Indeed,  there  is  an  infinite  number  of  possible  spectra  that  represent  the 
data  equally  well.  Most  of  these  functions,  however,  are  highly  irregular, 
i.e.,  contain  strong  (even  discontinuous)  fluctuations  and  values  in  the 
negative  range.  We  can  supress  these  unphysical  solutions  -  which  have 
their  origin  in  the  fact  that  the  inverse  of  the  integral-operator  in  (12) 
is  unbounded  and  hence  not  continuous  -  with  the  help  of  a  suitable 
regularization  of  the  problem.  We  proceed  as  follows: 


with  M  given  as 

Mki  =  /Aexp(-A(4-M,))<a.  (19) 

s 

In  these  equations,  the  support  S  is  defined  as  that  subset  of  (0,  oo) 
where  the  exponential  polynomial  is  greater  than  or  equal  to  zero, 

N 

S  =  {a  €  (0,  oo)  IE  JfcA  exp(-A4)  >  oj.  (20) 

k=  1 

For  each  set  of  current  data  4,  equations  (17)  to  (20)  determine  a 
family  of  spectra  /(A),  parametrized  by  the  Lagrangian  multiplier  A. 
For  A  =  0,  the  “smoothest”  solution  is  chosen,  namely  the  one  identical 
to  zero,  and  no  attention  is  paid  to  the  data  constraint  (15).  For  greater 
values  of  A,  the  fit  becomes  increasingly  better,  and  correspondingly  the 
spectrum  becomes  more  and  more  peaked;  generally,  it  holds 

^l|/2||  >  0,  (21) 

AA*<o.  (22) 

In  the  limit  A  — >  oo,  the  spectrum  converges  to  a  sum  of  (5-functions, 
and  one  can  recover  the  “optimal  choice”  for  the  mode  parameters  of 
ansatz  (10):  Denoting  by  Sp  (fi  =  1 . .  .  M)  the  partition  of  the  support 
into  its  compact  subsets,  the  relations  are 

A„  =  lim  f  dX,  (23) 

A-°°  Jsh 

/„  =  lim  [  7(A)  dA.  (24) 

A~+°°  Js„ 


Instead  of  seaching  for  the  “best”  fit,  we  specify  a  certain  quality 
level,  i.e.,  a  maximum  quadratic  deviation  A2,  and  then  determine  the 
“most  physical”  spectrum  that  meets  the  requirement.  In  other  words, 
we  search  for  the  (in  logarithmic  scale)  smoothest  spectrum 

Pll  =  jf  72( A)x  A  Min  (13) 

among  all  positive  trial  functions 

7(A)  >  0,  Ae(0,oo),  (14) 

which  sufficiently  represent  the  data 

y  (  r  7(A)  exp(— A4)  dX  -  iky  <  A2.  (15) 

a=i 

Clearly,  this  task  is  equivalent  to  a  variational  problem  with  con¬ 
straints,  represented  by  the  Lagrangian  function 

1  r°°  f°° 

c  =  2I  /2(A)y +j0  v(m*)dx  (i6) 

+  iA[y(|  7(A)exp(— A4)dA-  4)  ~a2]> 

where  the  function  F(A)  ensures  the  positivity  of  the  solution  and  the 
parameter  A  6  [0,  00)  takes  the  data  condition  into  account. 


Fig.  6:  Family  of  spectra  7(A)  for  the  current  data  given  in  Fig.  7, 
parametrized  by  the  Lagrangian  multiplier  A(0<A<1,1<A<  1010). 

It  is  interesting  to  note  that  this  behavior  (illustrated  by  Fig.  6)  is 
actually  the  basis  of  our  numerical  solution  strategy:  While,  for  any 
given  finite  value,  the  spectral  equations  are  rather  difficult  to  solve,  we 
found  it  very  satisfactory  to  start  at  A  =  0  and  follow  the  trajectory  of 
the  Jk( A)  through  solution  space  (using  Newton’s  scheme). 
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Comparison  of  the  Methods 

Having  outlined  the  principles  both  of  the  rate  window  method  and 
of  spectral  analysis,  we  now  proceed  to  compare  the  techniques  directly. 
First,  it  is  clear  that  our  new  approach  requires  a  considerably  increased 
mathematical  and  numerical  effort  compared  to  the  rate  window  method; 
among  other  things  it  requires  the  repeated  iterative  solution  of  a  cou¬ 
pled  set  of  N  nonlinear  equations,  with  each  iteration  involving  an 
eigenvalue/eigenvector  search  and  the  determination  of  all  positive  real 
zeros  of  a  high-order  exponential  polynomial.  To  demonstrate  that 
this  increased  effort  is  really  justified,  let  us  apply  both  schemes  to  a 
set  of  constructed  data  with  known  characteristics.  Fig.  7  shows  such 
a  set  of  50  data  points,  representing  a  two-mode  spectroscopy  signal 
superimposed  with  some  Gaussian  noise  reflecting  measurement  errors. 
Explicitly,  we  assumed  a  form 

4  =  0.4exp(— 24)  +  1.5exp(-54)  +  64,  (25) 

with  4  equally  spaced  at  intervals  of  At  =  0.1,  4  =  0.1(ib  -  1)  +  0.05, 
and  <54  drawn  from  a  normal  distribution  n(0,0.05)  with  zero  mean 
and  0.05  standard  deviation  (h  relative  error  of  about  2.5%.) 
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Fig.  7 :  Constructed  spectroscopy  signal  as  basis  of  a  method  comparison, 
consisting  of  two  modes  (Ai  =  2,  A2  =  5)  with  superimposed  noise. 

Let  us  first  consider  the  results  of  the  original  rate  window  method. 
Fig.  8  shows  the  spectrum  Ai(A)  as  defined  in  equation  (8),  plotted 
logarithmically  in  three  decades  ranging  from  A  =  0.1  to  A  =  100. 
As  expected  (because  A2/Ai  =  2.5  is  smaller  than  the  minimum  resolu¬ 
tion  of  about  10),  it  is  not  possible  to  separate  the  two  modes  present 
in  the  signal.  Instead,  the  sjmctrum  is  represented  as  one  single  peak 
with  the  maximum  located  A  ss  4.5.  (Graphical  evaluation.) 


Fig.  8:  Rate  window  At  spectrum  for  the  two-mode  signal  diplayed  in 
Fig.  7,  with  the  separate  single-mode  spectra  shown  for  comparison. 


Now  we  turn  to  analyzing  the  results  of  the  spectral  analysis  method. 
Fig.  9  depicts  the  spectrum  /(A)  in  the  range  of  1  to  10,  evaluated  for 
A  =  1010.  (Note  that  the  interval  spans  only  one  decade,  as  opposed  to 
three  in  Fig.  8.)  The  spectrum  clearly  resolves  the  two  modes  present  in 
the  signal;  both  the  rate  constants  A  and  the  amplitudes  I  are  estimated 
with  reasonable  accuracy.  Further  investigations  confirmed  that  the 
error  of  5-10%  is  directly  related  to  the  superimposed  noise,  for  <54  =  0 
we  were  able  to  determine  the  constants  up  to  an  accuracy  of  10'6. 


Rate  Constant  log  (A) 

Fig.  9:  Spectrum  /(A)  of  the  spectral  analysis  method,  with  A  =  1010. 

A  comparison  of  the  two  spectra  clearly  shows  the  superiority  of  the 
spectral  analysis  technique,  both  in  terms  of  resolution  and  accuracy. 
Extended  numerical  experiments  have  shown  that  there  are  no  prin¬ 
ciple  bounds  in  this  respect,  the  achievable  quality  is  only  limited  by 
superimposed  noise  of  the  input  data  (and  the  accuracy  of  the  employed 
floating  point  arithmetic).  At  a  noise  level  of  2.5%,  rate  constants  which 
differed  by  less  than  1.5  could  easily  be  resolved.  This  corresponds  to 
an  energy  resolution  of  better  than  10  meV  at  room  temperature,  an 
500%  improvement  over  the  rate  window  method. 

An  additional  advantage  of  the  new  technique  is  that  it  is  also  better 
adapted  to  the  experimental  set-up  depicted  in  Fig.  1:  The  actually  col¬ 
lected  data  are  a  finite  number  (typically  1024)  of  values  4  sampled  at 
equidistant  times  4i  just  as  required  by  the  spectral  analysis  technique. 
To  use  only  two  of  these  data  points  (as  in  the  traditional  rate  window 
method)  would  correspond  to  wasting  the  bulk  of  the  already  acquired 
information;  to  exploit  the  knowledge  of  the  complete  function  i(t)  (as 
we  did  in  our  analysis)  would  require  the  use  of  potentially  unstable 
methods  of  extrapolation. 

Much  further  effort  is  required  before  our  technique  can  replace  the 
rate  window  method  as  the  standard  tool  of  deep  level  spectroscopy: 
Tests  must  be  implemented  to  decide  whether  the  data  can  be  reason¬ 
ably  represented  by  an  exponential  sum  in  the  first  place.  A  statistical 
analysis  of  the  relation  between  the  quality  of  the  input  and  the  confi¬ 
dence  limits  of  the  output  is  needed,  and  an  interactive  user  interface 
must  be  designed  to  allow  convenient  manipulation  and  evaluation  of 
large  amounts  of  data.  Nonetheless,  we  believe  that  the  demonstrated 
advantages  of  the  new  method  will  make  this  effort  worthwhile. 
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